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INTRODUCTION 


Consider  a  large  noisy  room  in  which  sound  can  reverberate.  In  one  end  o(  the 
room  a  speaker  radiates  a  narrow  band  signal.  In  a  remote  part  of  the  room  a  sensor 
array  records  the  sound  pressure  at  each  sensor  location  and  attempts  to  detect  the  sig¬ 
nal.  Let  k  denote  the  number  of  sensors.  Locally,  the  signal  can  cross  the  array  in  a 
small  number  of  modes  of  travel.  Let  m  denote  the  number  of  such  modes.  We  shall 
assume  that  m<  k.  Each  mode  may  be  associated  with  a  separate  acoustic  path.  In  the 
simplest  case,  each  mode  may  be  represented  by  the  time  delays  associated  with  a 
plane  wave  arrival.  However,  this  assumption  is  not  necessary  for  this  discussion.  All 
that  will  be  assumed  is  that  each  mode  is  characterized  by  a  rank-1  covariance  matrix 
and  that  the  vector  factor  of  each  of  these  rank-1  matrices  is  known.  Let  v,  denote 
the  vector  factor  for  the  i  th  mode. 

Note  that  since  the  signal  is  assumed  to  be  narrow  band,  it  is  convenient  to  for¬ 
mulate  the  problem  in  the  frequency  domain.  This  means  that  all  of  the  pressures  dis¬ 
cussed  will  be  represented  by  complex  numbers. 

The  signal  will  now  be  characterized  by  the  amplitude  coefficients  associated  with 
the  modes.  The  signal  field  at  any  time  is  the  sum  of  the  products  of  each  mode  and 
its  associated  amplitude.  Let  ex,  denote  the  amplitude  of  the  i  th  component.  Then 
the  total  signal  field  can  be  described  by  a  vector,  5,  whose  components  arc  the  total 
signal  pressures  at  each  of  the  sensor  positions. 

s  =  aiv,+  a2v2+  •  ■  ■  +amvn  O) 

Let  V  denote  a  matrix  whose  column  vectors  arc  the  vector  factors. 

V  =  [  v,  v2  ■  •  ■  vm  ]  (2) 


and 


a, 

a2 


so  that 

s  =  V  a  (4) 

So  the  number  of  rows  in  V'  is  k,  the  number  of  elements  in  the  array,  and  the  number 
of  columns  in  V  is  m .  a  is  an  m  element  complex  vector. 

It  will  also  be  convenient  to  define  a  covariance  matrix  for  the  complex  coefficients,  a,. 
Let  <  >  denote  an  ensemble  average.  Then  let 


k 


m 


A  s  <  aa"  >  (5) 

Under  certain  assumptions,  the  signal  covariance  matrix  will  still  be  of  rank-1.  For 
example,  we  might  require  that  the  source  not  move,  that  none  of  the  reflecting  surfaces 
move,  that  no  air  currents  disturb  the  sound  path,  etc.  Such  assumptions  arc  too  stringent 
for  the  purposes  of  this  discussion.  Instead,  we  shall  assume  that  the  sound  paths  vary 
enough,  or  at  least  involve  enough  unknown  parameters,  that  we  must  attempt  to  estimate 
the  a's ,  or  A  .  We  shall  assume  that  the  rank  of  A  is  m. 

The  novelty  in  this  paper  will  be  the  rank-m  signal.  The  conceptual  problems  arc 
considerably  more  difficult  than  for  the  rank-1  case.  Different  criteria  lead  to  different 
detector  designs,  and  the  connection  between  the  designs  is  not  obvious.  The  first 
viewpoint  which  we  use  will  ignore  the  detection  problem  and  concentrate  on  the  estima¬ 
tion  problem. 


MAXIMUM  LIKELIHOOD  ESTIMATION  OF  a 

Maximum  likelihood  estimation  theory  seems  to  provide  the  simplest  path  10  cxiend 
existing  array  processing  theory  for  this  multiple-parameter  signal  problem.  We  shall 
assume  Gaussian  noise.  Let  x  denote  a  column  vector  of  complex  pressures  from  each 
sensor.  Let  C  denote  the  noise-only  covariance  matrix.  Then  for  any  received  pressure 
field .  x ,  the  problem  is  to  choose  a  to  minimize 


(x  -  1'  a)liC~l(x  -  V  a) 


This  can  be  expanded  as 

xhC~'x  -  x"C~]V  a  -  aHV"C~lx  +  a"v"C~'V  a 

* 

We  can  rearrange  the  terms  to  get 

x"c~xx  -  x" c~'v  (v" c~]vyW" C~] x 

+  (a-(V"  C-'vy'v"  C~{x)"  (V"  C~lV)  {a-(V"  C~'vy]v"  C~]x) 
Obviously,  the  minimum  of  this  quantity  occurs  when 

a0  =  (v"  c-'vylv"  C~'x 

and  the  minimum  is 

x"c~]x  -  x"c~]v  (v" c~'vy'v" C~]x 


It  is  appropriate  to  define  an  estimation  matrix  operator  as 

e  =  (  v"c~'v  y'v"c~] 


(6) 


(7) 


(8) 


(9) 

(10) 


(11) 


2 


m 

via 


»ii 


This  operator  warrants  further  examination.  The  reader  may  note  that 
V(V"C~l V  ylVHC~i  is  the  projection  operator  for  V7  when  the  metric  matrix  is  C~\  This 
association  with  projection  operators  provides  a  nice  geometric  interpretation  of  the  pro¬ 
cessing  algorithms. 


Note  that  £  should  be  used  with  caution  when  any  of  the  v,  become  nearly  equal.  In 
this  case,  the  matrix  V7"  C-1!7  becomes  nearly  singular  and  some  undesirable  noise 
amplification  effects  can  occur.  This  subject  is  still  under  investigation. 


Of  course,  the  proper  extension  of  conventional  beam -forming  can  be  found  by  set¬ 
ting  C  to  the  identity  matrix. 


It  is  probably  best  not  to  refer  to  the  processing  scheme  described  above  as  bcam- 
forming.  All  current  beam-forming  theory  assumes  that  the  estimation  operator  is  a  sim¬ 
ple  column  vector.  The  fact  that  £  is  a  matrix  complicates  the  processing  somewhat. 


ESTIMATING  THE  ESTIMATOR 


Assuming  that  the  estimator  matrix,  £,  is  to  be  used,  it  must  first  be  computed  from 
C  and  V .  But  if  C  is  to  be  estimated  from  data,  it  may  be  difficult  to  obtain  noise-only 
data  from  which  to  compute  C.  Let  us  assume  that,  unbeknown  to  the  operator,  a  signal 
is  present  to  contaminate  the  estimation  of  C .  Let  us  examine  the  error  which  will  result 
from  his  attempt  to  evaluate  Eq.  11.  He  will  actually  compute 


£s+n=  (T"(C  +  V7  A  v"y'vy'v"  (C  +  V7  /t  V") 


We  must  examine  what  happens  when  C  +  V  A  V"  is  substituted  for  C.  Note  that  from 
Woodbury's  formula 


(  C  +  V  A  V"yl  =  c~x-  c-1!/  T  V11  c~ 


where 


r-'=  a~]  +  f"rV 


Note  that,  among  other  things,  this  means  that 


(v" (C  +  v  a  v7"r1v,r1  =  (v"c~'v  -  v" c~'v  t  v" c~xvyx 


=  (vH  c~lvy[-u 


where 


T~'+Lr'=(  VhC~xV  ) 


(v"(c  +  v  a  v"y'vyl  =  (  i'"r'r  y'  +  a 


(18) 


Wc  can  use  this  to  compute  the  effect  of  the  presence  of  signal. 

£s+s=  ( (  v"c-]v  r'+4)  y"(  c-'-c  'v  r  v"c~v)  (19) 

=  (  v"c~'v  y'v"C~'-T  v"C~]+A  V"  C~'-A  V"  C~'V  T  V"  C~x 

=  (  V"C-'V  y'v"C-'-(  T  -  A  +  A  (V"C'lV)  T  )  v"  c~] 

=  (  v"c~'v  )-'V',,C'1-(  T  -  A  +  A  (  T-'-A~x)  T  )  V11  C~] 

=  (  v" c'xv  yxv"c~x=  e 

In  other  words,  no  error  results.  The  formula  is  invariant  with  respect  to  the  arrival  of  the 
signal. 


POWER  ESTIMATION 

The  estimate  of  the  signal  pressure  in  the  sensors  is 

s^=VEx  (20) 

From  this,  wc  can  estimate  the  signal-like  energy  as 

<  (VEx)"(VEx)>  =<  x"  E"  V"  V  E  x>  (21) 

=  <  tr(E"v"V  E  x  x"  )>  =  =  <  tr{EnV"V  E  C  )>  =  tr(  V"  V  E  C  EH) 

Note  that,  because  of  the  result  in  the  previous  section,  wc  can  be  ambiguous  about 
whether  E  was  estimated  from  the  true  noise-  only  covariance  matrix  or  from  the  current 
data  matrix.  So 


<  4',rcst>  =  lr(  (V"V)  (22) 

This  expression  is  evidently  the  proper  extension  of  the  popular  adaptive  bearing 
response  algorithm,  sometimes  referred  to  as  the  "maximum  likelihood"  or  "minimum 
variance  distortionless  look”  algorithm. 


Of  course,  we  can  also  easily  extend  the  conventional  beam-forming  algorithm.  In 
this  ease 

£conv=  (  v  "V'  TV"  (23) 

So 

tr(  (VHV  )  EconvC  )  =  tr(  (V"C  V  )  )-')  (24) 

This  is  not,  however,  the  optimum  detection  structure.  There  arc  some  other  con¬ 
ceptual  issues  to  be  discussed  below. 

LIKELIHOOD  RATIO  DETECTION 

The  structure  for  the  likelihood  ratio  detector  is  easy  to  derive.  We  need  simply  to 
look  at  the  exponent  in  the  ratio  of  the  Gaussian  probability  functions. 

-x"(  C  +  V  A  V"yxx  +  x"C-'x  =  x"C-'V  T  V"  C~xx  (25) 

If  several  samples  of  a:  are  to  be  averaged,  this  may  be  more  conveniently  written  as 

tr(  C~lV  T  VHC-'<  x  x">  )  (26) 

Note  that  in  this  case,  one  must  evidently  have  noise-only  information  from  which 
to  estimate  C . 

In  fact,  the  form  of  the  likelihood  ratio  detector  suggests  theoretical  problems  which 
are  considerably  more  difficult  than  one  might  guess.  The  probe  variable,  Eq.  26,  docs  not 
behave  ^cording  to  simp'e  gamma  distribution  which  we  expect  for  square-law  detec¬ 
tors.  It  is  formed  from  a  sum  of  exponential  variables  with  different  variances.  This,  as 
will  be  seen  below,  complicates  the  conceptual  picture. 


PERFORMANCE  INDICES 

In  the  previous  section,  we  saw  that  the  optimum  detector  takes  the  form  of  a  qua¬ 
dratic  inner  product.  For  discussion  of  detector  performance  it  is  convenient  to  generalize 
this  algorithm.  We  shall  define  a  variable 

q  —  <  xH  W  x  >  =  lr(  W  <  x  xH  >  )  (27) 

where  W  is  a  general  square  matrix.  This  function  can  be  regarded  as  a  generalization  of 
the  beam -forming  process.  It  will  result  in  a  scalar  variable,  q.  We  must  now  define  a 
signal-to-noisc-ratio  function  to  describe  the  statistics  of  q.  The  first  step  is  to  examine  the 
moments  of  q. 


t 

) 


3L 


<  n  >  s*N=  tr(  (C  +  V  4  V")  If  )  (29) 

Here  we  encounter  a  subtle  conceptual  trap.  It  is  tempting  to  use  these  two  formulas  to 

define  a  signal-to-noise  ratio.  This  approach  depends  on  the  standard  deviation  ol  the  vari¬ 
able  being  proportional  to  the  mean  value  of  the  variable.  However,  since  t|  is  a  sum  ol 
unequal  exponential  variables,  this  will  not  give  a  good  indicator  of  the  standard  deviation 
of  q.  We  must  compute  the  variance  more  carefully. 

vi iruince  -  <  (  x"  H'  x  r>  N  <  q  >  y  (30) 

This  requires  us  to  evaluate  fourth  order  moments  for  complex  Gaussian  variables.  It  is 

well  known  that  if  4:-  4:-  4^  and  4^  are  real  Gaussian  random  variables,  then 

<  4;4:4^4j>  =  <  4s4:>  <  4^4j>  +  <  4:4'>  <  4:>4-i>  *  <  4;4-i>  <  4:4r>  ■  However,  the 

corresponding  result  for  complex  Gaussian  variables  is  not  widely  known.  It  is 

<  4*4-’4sC.J>  -  <  c*2:>  <  4i2.i>  •  <  c"2:>  <  2"2.i>  (31) 


With  this  knowledge  and  some  index  arithmetic  it  can  be  seen  that 


variance  ir(  W  1 C  If  C  ) 


We  are  now  in  a  position  to  define  a  signal-to-noise  ratio  (snr). 


tr(  f  ,3  \  H  If  )  ir(  \\’"V  A  \  11  ) 
tr(  If  "  C  If  C  ) 


This  is  easy  to  compute  for  the  likelihood  ratio  detector. 


i  w)[-K  = 


( tr(C'-'V  7  V"  C~‘f  ,3  V"  )  r 
tr(C~  l  7  \  "C-'C  f-'V  7  V"C~'C  ) 


(snr)t K- 


( tr(  (V"C-'V  )  T  (  V  C~  'f  )A  ))‘ 
tr((V"C-'V  )  T  (V"C~]V  )  /■  ) 


MAXIMIZING  SIGNAL-TO-NOISE  RATIO 

Curiously,  there  is  reason  to  believe  that  even  Eq.  33  docs  not  tell  the  whole  story. 
To  sec  this,  consider  how  W  might  be  chosen  to  maximize  the  signal-to-noise  ratio.  Wc 
can  use  differential  calculus,  or  the  Schwarz,  inequality  to  maximize  the  signal-to-noise  ratio 
with  respect  to  Vf  . 

VfSNR  =  C-'V  A  V"C-'  (36) 

This  produces  a  squared  signal-to-noise  ratio  of 


'-’VW.VJVLVAV! 


(w)‘„  -  tr (  <r"C  1 I'M  {V"C  '\')A  )  (37) 

II  is  not  entirely  clear  why  this  detector  would  not  he  as  good  as  the  likelihood  ratio 
detector.  It  must  be  due  to  the  peculiar  behavior  of  the  tail  end  of  the  distribution.  This 
illustrates  the  difficulties  of  dealing  with  sums  of  unequal  exponential  variables. 

Note  that  Eq.  37,  like  Eq.  22  or  Eq.  24,  could  be  used  as  a  probe  function.  How¬ 
ever,  the  use  of  fourth-order  moments  is  not  generally  desirable. 

ESTIMATION  OF  A 

Minimizing  the  right  side  of  Eq.  37  with  respect  to  A  can  yield  an  estimate  of  A  . 
Again,  the  idea  is  that  the  noise  which  looks  most  like  the  signal  will  be  that  which  is  most 
difficult  to  process  against.  Of  course,  the  trace  of  A  must  be  constrained,  so  we  have 
merely  to  m inim i/e 

in  (VhC-'V)A  (V"C-lY)A  )  {  ^  } 

( ir(A  ))2 

We  can  minimize  this  with  respect  to  A  .  The  result  is 

a  =  (v"c~'v  r2  ( 39i 

However,  it  is  not  clear  that  this  is  a  good  estimator.  Once  again,  the  appearance  of 
fourth-order  moments  may  cause  problems. 

ANOTHER  FACTORIZATION 

In  all  of  the  above  discussion,  the  signal  covariance  matrix  has  been  expressed  as  a 
product  of  three  factors.  That  is.  the  signal  covariance  matrix  has  been  written  as  V  A  V" . 
This  factorization  was  chosen  because  it  lends  itself  to  physical  interpretation.  However, 
from  a  mathematical  viewpoint,  the  factorization  is  not  unique.  For  some  purposes,  we 
can  find  a  simpler  factorization.  If  we  are  willing  to  give  up  the  obvious  physical  interpre¬ 
tation,  we  can  choose  T  so  that  A  is  an  identity  matrix  and  (V"C~'V)  is  diagonal.  In  other 
words,  we  can  assume  without  loss  of  generality  that 


‘V  =  D 


where  /  is  the  identity  matrix  and  D  is  a  diagonal  matrix.  These  substitutions  in  the  above 
equations  provide  an  apparent  simplification.  These  simplifications  may  help  provide  an 
intuitive  understanding  for  readers  who  are  comfortable  with  the  geometric  implications  of 
Eq.  40. 

CONCLUSIONS 

This  paper  extends  the  class  of  signals  we  can  address  with  established  array  signal 
detcction/cstimation  ideas.  It  docs  not  provide  a  logical  alternative  to  familiar  array  pro¬ 
cessing  techniques.  Rather,  it  allows  extension  of  these  signal  processing  idea*  to  a  class  of 
signal  which  is  often  present  but  often  ignored.  It  provides  a  way  to  deal  with  problems  in 


mmi 


which  multiple  signals  characterize  a  single  hypothesis. 

Probably  the  most  important  idea  developed  above  is  the  estimator  matrix,  l  .  as 
described  in  let]  II.  The  utility  of  this  estimator  depends  on  Eq.  IP.  That  is,  we  can  esti¬ 
mate  /.  without  knowing  whether  a  signal  was  present  in  ihe  data  sampled. 

The  utility  of  probe  or  estimation  functions  such  as  Fit] .  22,  24,  37,  and  39  is  not  yet 
clear.  The  answer  will  probably  depend  on  details  of  the  actual  applications.  It  is  impor¬ 
tant  to  carefully  investigate  the  a  prion  information  about  the  signal  parameter  matrix,  A. 
and  the  required  search  space. 

Equation  33  can  probably  give  a  reasonable  estimate  of  detection  performance  in 
most  cases.  However,  since  the  noise  is  a  sum  of  unequal  variance  exnonential  variables, 
it  may  not  always  give  accurate  results.  Equation  33  should  be  useful  if  applied  with  cau¬ 
tion  and  insight.  Any  analysis  which  ignores  Eq.  33  will  probably  be  wrong. 

In  general,  if  the  problem  involves  a  single  signal  which  lits  the  usual  beam -forming 
assumptions,  the  ideas  presented  above  are  not  needed.  If  the  signal  breaks  up  into  com¬ 
ponents  which  may  not  be  entirely  coherent,  the  ideas  developed  above  may  come  into 


Appendix 

FSTIMATION  MATRIX  AND  CONSTRAINED  OPTIMAL  BEAM-FORMIN' (» 


The  estimation  matrix,  £,  can  play  a  role  in  beam -forming.  It  occurs  in  the  structure 
of  optimal  beam -forming  with  hard  constraints.  Since  the  derivation  seems  not  to  be  avail¬ 
able  in  the  open  literature,  we  shall  examine  it  here. 

The  basic  problem  may  be  stated  as  follows.  Design  a  beam -former  which  has 
minimum  noise  output  subject  to  the  constraint  of  specified  responses  in  certain  directions. 
Let  2  denote  a  vector  of  beam  -form  ing  weights.  Let  V  denote  a  m  atrix  whose  colum  ns  arc 
the  models  for  signals  from  the  constraint  directions.  Let  h  denote  a  column  vector  of 
constraint  values  for  each  direction.  The  the  constraint  can  be  written  as 

VHz  =  l>  (  A.  1  j 

This  type  of  constraint  may  be  used  in  several  ways.  If  the  constraint  directions  are 
close  together,  it  could  be  used  for  main-lobe  maintenance  or  broadening.  The  beam 
could  be  made  to  have  main-lobes  in  two  or  more  different  directions.  The  beam  could 
also  be  made  to  have  nulls  in  certain  directions  by  setting  the  appropriate  components  of  h 
to  zero.  For  example,  if  v,  was  the  desired  look  direction,  main-lobe  maintenance  could 
be  accomplished  by  setting  the  other  v’s  to  directions  about  v,  and  computing  b  from 

r"v.  =  b. 

We  shall  assume  that  the  number  of  constraint  directions  is  less  than  the  number  of 
sensors. 

The  optimization  process  follows  standard  paths.  First,  we  define  a  quantity  to  be 
minimized.  Let  C  denote  the  noise  covariance  matrix. 

p  =  z"c  z  +  X(V"z  -  b)"(V"z  -  b)  (A. 2) 

We  shall  define  z  as  the  vector  which  minimizes  p  and  find  the  limit  as  X  goes  to  infinity. 
To  simplify  the  notation,  it  is  convenient  to  define  the  matrix 


M  =  C  +  XV  V" 


( A.3) 


Then  we  can  expand  the  expression  for  p. 

\i=z"  (C  +  XV  v"  )z  -  Xz"VB  -  Xh"v"z  +  Xb"  h  (A. 4) 

=  (M  z  -  XV  b)"  JT'(M  z  -  XV  b)  +  Xb"  (I  -  XV"  M~'\')b 

Since  the  first  term  is  the  only  one  which  depends  on  z,  it  is  sufficient  to  minimize 
it.  This  means  that 


(A. 5) 


z  =  X(C  +  XV  V"y'V  b  =  (  -f  C+V  V'y'V  b 

A 

Since  V  \  "  is  singular,  more  manipulation  is  required  before  it  is  clear  how  to  take  the 
limit  as  X  goes  to  infinity.  Again,  Woodbury’s  theorem  helps. 

z  =  (XC~'~  X2C~'V  (/  +  XV"  C~'vy'v"  C~x)  V  b  (A. 6) 

=  c~'v  (  A/  +  V"C~]V  y'b 

A 

Now  we  can  take  the  limit  as  X  goes  to  infinity. 

z  =  C~'V  (  V"C~'V  y'b  =  E"  b  (A. 7) 

In  other  words,  the  optimal  solution  consists  of  first  applying  the  estimation  opera¬ 
tor,  E ,  and  then  simply  multiplying  by  the  constraint  values  in  b. 


